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Abstract 

We show an interesting connection between non-standard (non Boltzmannian) distri- 
bution functions arising in the theory of violent relaxation for collisionless stellar systems 
(Lynden-Bell 1967) and the notion of superstatistics recently introduced by Beck & Cohen 
(2003). The common link between these two theories is the emergence of coarse-grained 
distributions arising out of fine-grained distributions. The coarse-grained distribution 
functions are written as a superposition of Boltzmann factors weighted by a non-universal 
function. Even more general distributions can arise in case of incomplete violent re- 
laxation (non-ergodicity). They are stable stationary solutions of the Vlasov equation. 
We also discuss analogies and differences between the statistical equilibrium state of a 
multi-components self-gravitating system and the metaequilibrium (or quasi-equilibrium) 
states of a collisionless stellar system. Finally, we stress the important distinction between 
entropies, generalized entropies, relative entropies and //-functions. We discuss applica- 
tions of these ideas in two-dimensional turbulence and for other systems with long-range 
interactions. 

1 Introduction 

Recently, several researchers have questioned the "universality" of the Boltzmann distribution 
in physics. This problem goes back to Einstein himself who did not accept Boltzmann's prin- 
ciple S = khaW on a general scope because he argued that the statistics of a system (W) 
should follow from its dynamics and cannot have a universal expression PJI2]. In 1988, Tsallis 
introduced a generalized form of entropy in an attempt to describe complex systems jSj- This 
was the starting point for several generalizations of thermodynamics, statistical mechanics and 
kinetic theories (see, e.g., [4 ). A lot of experimental and numerical studies (in an impressive 
number of domains of physics) has then shown that complex systems exhibit non-standard dis- 
tributions and that, in many cases, they can be fitted by Tsallis g-distributions However, 
there also exists physical systems (like those that we shall consider here) that are described 
neither by Boltzmann nor by Tsallis distributions. 

An important question is to understand why non-standard distributions and generalized 
entropies emerge in a system. We have argued that non-standard distributions arise when 
microscopic constraints are in action [Oj. They sometimes appear as "hidden constraints" in- 
accessible to the observer. For "simple systems", the energetically accessible microstates are 



equiprobable and a standard combinatorial analysis leads to the Boltzmann entropy. Then, 
the equilibrium distribution (most probable macrostate) maximizes the Boltzmann entropy at 
fixed macroscopic constraints (mass, energy,...). For "complex systems", the a priori accessible 
microstates are not equiprobable, some being even forbidden, contrary to what is postulated 
in ordinary statistical mechanics. The non-equiprobability of microstates can be due to micro- 
scopic constraints (of various origin) that affect the dynamics. In certain cases, the microscopic 
constraints can be dealt with by using a generalized form of entropy. In principle, this en- 
tropy S = In W should be obtained from a counting analysis by assuming that the microstates 
which satisfy the macroscopic constraints and the microscopic constraints are equiprobable. 
An example of microscopic constraints is provided by the Pauli exclusion principle in quantum 
mechanics which prevents two fermions with the same spin to occupy the same site in phase 
space. Because of this constraint, the Boltzmann entropy is replaced by the Fermi-Dirac en- 
tropy which puts a bound /(x, v) < rjo on the maximum value of the distribution function. In 
this example, the exclusion principle is explained by quantum mechanics so it has a fundamen- 
tal origin. Another example is when the particles are subject to an excluded volume constraint. 
In simplest models (e.g., a lattice model), this is accounted for by introducing a Fermi-Dirac 
type entropy in physical space which puts a bound p(x) < cr on the maximum value of the 
spatial density. These entropies can be obtained from a combinatorial analysis which carefully 
takes into account the fact that two particles cannot be in the same microcell in phase space 
or in physical space. More generally, we can imagine other situations where some microscopic 
constraints (not necessarily of fundamental origin) act on the system and lead to non-standard 
forms of distribution functions and entropies. 

Non-Boltzmannian distributions can also emerge when the system does not mix well (for 
some reason) so that the evolution is non-ergodic. In that case, the system does not sample 
the a priori energetically accessible phase space uniformly and prefers some regions more than 
others. The effectively accessible phase space can have a complicated geometrical structure. In 
many cases, we do not know the nature of the microscopic constraints perturbing the dynamics, 
so that they act as "hidden constraints" inaccessible to the observer. We just see their effect 
indirectly because they lead to non-standard distributions. The fact that we do not know these 
microscopic constraints implies an indetermination in the selection of the entropy functional. 
For example, the Tsallis entropies [3] can be relevant for a certain type of non-ergodic behaviour 
when the phase space has a fractal or multifractal structure. This is appropriate in particular 
for porous media and in the case of weak chaos. In Tsallis generalized thermodynamics, the 
complexity of mixing is encapsulated in a single parameter q which indexes the entropies and 
characterizes the degree of mixing (q = 1 if the evolution is ergodic). In some cases, it is possible 
to determine the parameter q directly from the microscopic dynamics. In more complicated 
situations, it has to be adjusted to the situation by a fit. It would be interesting to obtain Tsallis 
form of entropy directly from a counting analysis by assuming that the energetically accessible 
microstates are equiprobable on a fractal phase space. In that case, Tsallis entropy could be 
viewed as an entropy on a fractal. One interesting aspect of Tsallis entropy is that it exhibits 
mathematical properties very close to those possessed by the Boltzmann entropy. Therefore, 
it represents the most natural extension of the Boltzmann entropy to the case of "complex" 
systems. However, Tsallis entropy is not expected to describe all types of complex systems. 
Depending on the constraints acting on the underlying dynamics, there exists situations in 
which the observed distribution differs from a g-distribution. In that case, we must consider 
more general forms of entropy S = — J C(f)dxdv where C(f) is a convex function jH]. 

Several microscopic models have been constructed to show how non-standard distributions 
and generalized entropies can emerge in a system. By introducing a kinetic interaction prin- 
ciple (KIP), Kaniadakis [7] has obtained a generalized form of Boltzmann and Fokker-Planck 
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equations that lead to a wide class of distribution functions at equilibrium. These general- 
ized equations arise when the expression of the transition probabilities is more general than 
usually considered. This can take into account quantum statistics or non-ideal effects (e.g. 
excluded volume) that are ignored in the standard derivation of the Boltzmann and Fokker- 
Planck equations. On the other hand, Borland 8j and Chavanis [6 J have introduced generalized 
stochastic processes and generalized Fokker-Planck equations in which the diffusion coefficient 
and the friction/drift terms explicitly depend on the concentration of particles. The dynamics 
of particles described by these stochastic processes has a complex (non-ergodic) phase space 
structure. These equations lead to non-standard distributions at equilibrium and they are as- 
sociated with generalized free energy functionals which play the role of Lyapunov functions. 
Generalized Fokker-Planck equations have also been studied by Frank [Oj. In fact, as discussed 
in Chavanis [6 , it is possible to generalize the usual kinetic equations (Boltzmann, Landau, 
Kramers, Smoluchowski,...) in such a way that they satisfy a H-theorem for an arbitrary form 
of entropy. Boltzmann, Fermi-Dirac, Bose-Einstein and Tsallis entropies are just special cases 
of this general formalism. As indicated previously, the generalization of standard kinetic mod- 
els can be viewed as a heuristic attempt to take into account "hidden constraints" in complex 
systems. What we are doing, essentially, is to develop an effective thermodynamical formalism 
(E.T.F.) to accommodate from our lack of complete information on the microscopic dynamics 
of a complex system. 

In a different context, Beck & Cohen [TO] have shown how non-standard distributions can 
arise in a system if an external variable (e.g. the temperature) is allowed to fluctuate. The 
probability of energy E is then given by a Laplace transform P(E) = J + °° f{(3)e~ fiE d(3 where 
f(P) is the distribution of fluctuations that must be regarded as given. When /(/3) is strongly 
peaked around a temperature (3q, the Boltzmann distribution P(E) = ^e~ ,3oE is recovered. 
Beck & Cohen gave particular examples of non-standard distributions P(E) arising from this 
formalism and Tsallis & Souza jTT] constructed the generalized entropies associated with these 
non-standard distributions. 

At the same time (ignoring the works of Beck & Cohen and Tsallis & Souza), we re- 
vived the concept of violent relaxation introduced by Lynden-Bell [TJ] for collisionless stel- 
lar systems described by the Vlasov-Poisson system and we showed how this theory predicts 
metaequilibrium states characterized by non-standard distribution functions jUJE]- Assum- 
ing complete relaxation (ergodicity), the coarse-grained distribution function (DF) is given by 
/(e) = -^H: J +o ° x(^) 7 ?e _ ' 7 ^ ,E+Q ' ) (ir7 where the function x(v) accounts for the conservation of the 
Casimir integrals and is determined by the initial conditions. In this context, the Casimir inte- 
grals play the role of "hidden constraints" because they are not accessible at the coarse-grained 
scale (which is the scale of observation). Due to the Liouville theorem in /i-space, they can give 
rise to an effective "exclusion principle" similar to the Pauli principle in quantum mechanics 
|12[ ITI] . In particular, the coarse-grained distribution is bounded by the maximum value of 
the initial (fine-grained) distribution: /(x,v, t) < max xv {/(x, v, t = 0)}. We gave partic- 
ular examples of non-standard distributions /(e) arising from this formalism, with emphasis 
on the Fermi-Dirac distribution and we introduced the notion of "generalized entropies" 
S[f] = — J C(f)dxdv (in /-space) associated with these coarse-grained distributions. The same 
ideas apply in two-dimensional (2D) turbulence where the coarse-grained vorticity is given by 
<^(V0 = zijf) I-oo x( cr ) cre_fT< ' /3 ^ +Q ' ] da O Uni HI! • In the case of geophysical flows that are forced 
at small scale, Ellis et al. [TH] interpret x( cr ) as a prior vorticity distribution encoding the 
statistics of forcing while for freely evolving flows x{ a ) is determined from the initial conditions 
by the Casimirs. In the point of view of Ellis et al. further discussed in Chavanis [T§] . 

the function x( a ) must be regarded as given and it directly determines the form of generalized 
entropy S\uJ] — — J C(uJ)dx (in cJ-space) associated with the coarse-grained vorticity field. The 
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small-scale forcing, encapsulated in the function x( cr ) ) can De viewed as a "hidden constraint" 
which affects the structure of the coarse-grained vorticity. 

The object of this paper is to emphasize the similarity between the Beck-Cohen superstatis- 
tics and the coarse-grained distributions arising in theories of violent relaxation. The point 
of superstatistics is that experimentally or numerically observed distributions are in general 
coarse-grained distributions which arise as averages of finer-grained distributions. Therefore, 
Lynden-Bell's statistics is a sort of superstatistics. This connection has not been noted pre- 
viously and we think that it deserves to be pointed out in detail. Furthermore, the notion of 
generalized entropies that we gave in [S| in the context of the theory of violent relaxation is 
similar to that given by Tsallis & Souza ^1] in relation with the Beck-Cohen superstatistics. 

The paper is organized as follows. We first start to emphasize the distinction between 
the statistical equilibrium state of a iV-stars system described by the Hamilton equations and 
the metaequilibrium states of a collisionless stellar system described by the Vlasov equation. 
To stress the analogies and the differences, we consider a stellar system with a distribution 
of mass. The statistical equilibrium state is described in Sec. |2] and the theory of violent 
relaxation is discussed in Sec. 13.21 The similarities (and differences) between coarse-grained 
distribution functions and superstatistics is shown in Sec. 13.41 We introduce the notion of 
generalized entropy S[f] associated with the coarse-grained distributions in Sec. 13.31 We show 
that the generalized entropies associated with the coarse-grained DF predicted by Lynden- 
Bell can never be the Tsallis functional S q [f] = — j(f q — f)drdv because Lynden-Bell's 
distribution is defined for all energies e while Tsallis g-distribution (with q > 1) has a compact 
support (the distribution function drops to zero at a finite energy). Then, in Sec. 13. 5^ we insist 
on the notion of incomplete violent relaxation and on the limitations of Lynden-Bell's statistical 
prediction. As the fluctuations weaken as the system approaches equilibrium, it can be trapped 
in a stationary solution of the Vlasov equation which is not the most mixed state. We interpret 
Tsallis functional S q [f] as a particular if -function in the sense of Tremaine, Henon & Lynden- 
Bell [20J > n °t as an entropy. We show that the proper form of Tsallis entropy in the context of 
violent relaxation is a functional S q [p] = — f (p q — p)dr]drdv of the fine-grained distribution 
p(r,v,r/). The maximization of S q [p] at fixed mass, energy and Casimirs is a condition of 
thermodynamical stability (in a generalized sense). By contrast, the maximization of a H- 
function (e.g., the Tsallis if -function) at fixed mass and energy is a condition of nonlinear 
dynamical stability for a steady state of the Vlasov- Poisson system of the form / = /(e) 
with / (e) < H31 1211 • The if-functions can be used to construct a wide class of stable 
models of galaxies which can be an alternative to Lynden-Bell's prediction in case of incomplete 
relaxation. Another alternative is to develop a dynamical theory of violent relaxation fUJ |22] 
in order to understand what limits mixing. In that case, non-ergodicity is explained as a decay 
of the fluctuations of the gravitational field driving the relaxation, not by a complex structure 
of phase space. Generalized entropies like S q [p] or C(p) are not necessary in that approach. 
Finally, in Sec. we discuss these ideas in the context of 2D turbulence and show that the 
notions of prior vorticity distributions and relative entropies introduced by Ellis et al. ^H] 
make the analogies with superstatistics much closer than for freely evolving systems. 

2 Statistical equilibrium state of a multi-components stel- 
lar system 

We wish to determine the statistical equilibrium state of a stellar system made of stars with dif- 
ferent mass rrii. This Hamiltonian system is described by the microcanonical ensemble where the 
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energy E and the particle numbers N{ (for each species) are fixed. A thermal equilibrium state 
is established due to the development of stellar encounters which randomize the distribution of 
particles ( "collisional" mixing). Mathematically, this statistical equilibrium state is obtained 
when the infinite time limit t — > +00 is taken before the thermodynamic limit iV — > +00 
defined in j2Sl 121] • This statistical approach is adapted to the case of globular clusters whose 
age is of the same order as the Chandrasekhar relaxation time t re i ax ~ (iV/ln N)tu We 
shall determine the most probable distribution of stars at statistical equilibrium by using a 
combinatorial analysis, assuming that all accessible microstates (with given E and = A^m^) 
are equiprobable. To that purpose, we divide the /i-space {r, v} into a very large number of 
microcells with size h. We do not put any exclusion, so that a microcell can be occupied by 
an arbitrary number of particles. We shall now group these microcells into macrocells each of 
which contains many microcells but remains nevertheless small compared to the phase-space 
extension of the whole system. We call v the number of microcells in a macrocell. Consider the 
configuration {riij} where is the number of particles of species j in the macrocell i. Using 
the standard combinatorial procedure introduced by Boltzmann, the probability of the state 
{riij}, i.e. the number of microstates corresponding to the macrostate {n^}, is given by 

(1) W({ niJ }) = l[N : 



1,3 J 



This is the Maxwell-Boltzmann statistics. As is customary, we define the entropy of the state 
{riij} by 

(2) S({n tl }) = \nW({ niJ }). 

It is convenient here to return to a representation in terms of the distribution function giving 
the phase-space density of species j in the i-ih macrocell: = fj{v^ Vj) = nijUij/uh 3 . Using 
the Stirling formula Inn! = nlnn — n, we have 

(3) lnW/(K}) = krny = ~J2^t ln t- 

I 1 1 fj 1 1 ij 

i,j i,j J J 

Passing to the continuum limit v — > 0, we obtain the usual expression of the Boltzmann entropy 
for different types of particles 

(4) S B = - Y [ A i n A d 3 rd \ } 

J mi mi 

up to some unimportant additive constant. This is the expression used by Lynden-Bell & 
Wood 26J in their thermodynamical description of "collisional" stellar systems (globular clus- 
ters). Assuming ergodicity, the statistical equilibrium state, corresponding to the most probable 
distribution of particles, is obtained by maximizing the Boltzmann entropy (J3J while conserving 
the mass of each species 

(5) M, = //, d 3 rd 3 v , 
and the total energy 

(6) E = \ I f l,2d3rd3v + \ f P M * T 
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where /(r, v) = fi(r, v) is the total distribution function and p = J fd 3 v the total density. 
The gravitational potential is determined by the Poisson equation 

(7) A$ = AnGp. 

Introducing Lagrange multipliers and writing the variational principle in the form 

(8) 5S B - f35E - a i 6M i = °> 

i 

we get 

(9) fi = Aie-^+V. 
The total distribution function is therefore given by 



(io) / = E A * 



It is a superposition of Maxwell-Boltzmann distributions with equal temperature k B T = 1/(5 
and different mass m,. According to the theorem of equipartition of energy, the mean squared 
velocity of species i decreases with mass such that 



2x Je~ l3mi ^v 2 d 3 v 3k B T 



TYli 



Therefore, heavy particles have less velocity dispersion to resist gravitational attraction so they 
preferentially orbit in the inner region of the system. This leads to mass segregation. The effect 
of mass segregation can also be appreciated by writing the distribution function (JUJ) in the form 



(12) /^) = Q;[/;(e)r /r % 

where = Ai/A™" 1 ^™' 1 is a constant independent on the individual energy e = v 2 /2 + $. On 
the other hand, developing a kinetic theory for a multi-components self-gravitating system, one 
obtains the multi-species Landau equation 

/-i q\ dfi . dfi dfi d ^ f ( dfi r d fj\« , 

(13) — h V • — h r ■ — — = — > / iv 771,-f,-— TTLifi—^- \d V , 



(14) = 2vrG 2 -lnA 5^ - 



7J V M 2 



where u = v — v' is the relative velocity of the particles involved in an encounter, In A = 
J +o ° dk/k is the Coulomb factor (regularized with appropriate cut-offs) and we have set /j = 
fj(r,v',t) assuming that the collisions can be treated as local (see Kandrup for a critical 
discussion of this approximation and formal generalizations). The Landau-Poisson system 
conserves the total mass of each species of particles and the total energy of the system. It 
also increases the Boltzmann entropy (@J) monotonically: S B > (H-theorem). The linearly 
dynamically stable stationary solutions of the Landau-Poisson system are determined by the 
mean-field Maxwell-Boltzmann distributions Q which are local maxima of the Boltzmann 
entropy at fixed E, N iy so they correspond to statistical equilibrium states. We emphasize 
that the Boltzmann distribution is the only stationary solution of the Landau equation. The 
problems linked with the absence of strict statistical equilibrium state in self-gravitating systems 
and the notion of long-lived metastable states are discussed in [21] . 
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3 Violent relaxation of collisionless stellar systems 



3.1 The Vlasov-Poisson system 

We shall now contrast the statistical equilibrium state of "collisional" stellar systems (globular 
clusters) to the metaequilibrium, or quasi-equilibrium, states of "collisionless" stellar systems 
(elliptical galaxies). The distinction between collisional and collisionless dynamics is just a 
question of timescales. The age of elliptical galaxies is by many orders of magnitude smaller 
than the Chandrasekhar relaxation time so that their evolution is governed by the Vlasov- 
Poisson system 

(15) ^ + V '^ + F -^ = ' 



(16) A$ = 4ttG J fd\, 

where F = — V$ is the force by unit of mass experienced by a particle. Mathematically, the 
Vlasov equation is obtained when the N — > +oo limit is taken before the t — > +oo limit. 
Indeed, the collision term in Eq. (fTT?j) scales as 1/N in a proper thermodynamic limit 
so that it vanishes for N — > +oo. The Vlasov equation, or collisionless Boltzmann equation, 
simply states that, in the absence of encounters, the distribution function / is conserved by 
the flow in phase space. This can be written df/dt = by using the advective derivative. 
The Vlasov equation can also be obtained from the iV-body Liouville equation by making a 
mean-field approximation, i.e. the iV-body distribution factors out in a product of N one-body 
distributions. We note that the individual mass m ; of the stars does not appear in the Vlasov 
equation. Therefore, in the collisionless regime, the evolution of the total distribution function 
does not depend on how many species of particles exist in the system (unlike the Landau 
equation). This implies that the collisionless dynamics does not lead to a segregation by mass 
contrary to the collisional dynamics. It is easy to show that the Vlasov equation conserves 
the total mass M and the total energy E of the system. Furthermore, the Vlasov equation 
conserves an infinite number of invariants called the Casimir integrals. They are defined by 
Ih = J h(f)d 3 rd 3 v for any continuous function h(f). The conservation of the Casimirs is 
equivalent to the conservation of the moments of the distribution function denoted 

(17) M n = J f n d 3 rd 3 v. 

The Vlasov-Poisson system also conserves angular momentum and impulse but these constraints 
will not be considered here. Finally, the Vlasov equation admits an infinite number of stationary 
solutions whose general form is given by the Jeans theorem 

3.2 The metaequilibrium state 

The Vlasov-Poisson system develops very complex filaments as a result of a mixing process in 
phase space (collisionless mixing). In this sense, the fine-grained distribution function /(r, v, t) 
will never reach a stationary state but will rather produce intermingled filaments at smaller and 
smaller scales. However, if we introduce a coarse-graining procedure, the coarse-grained distri- 
bution function /(r, v,t) will reach a metaequilibrium state /(r, v) on a very short timescale, 
of the order of the dynamical time tp. This is because the evolution continues at scales smaller 
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than the scale of observation (coarse-grained). This process is known as "phase mixing" and 
"violent relaxation" (or collisionless relaxation) [23]. Lynden-Bell [T2| has tried to predict the 
metaequilibrium state achieved by the system in terms of statistical mechanics. This approach 
is of course quite distinct from the statistical mechanics of the iV-body system (exposed in Sec. 
I2J) which describes the statistical equilibrium state reached by a discrete iV-body Hamiltonian 
system for t —* +00. In Lynden-Bell's approach, we make the statistical mechanics of a field, 
the distribution function f(r,v,t) whose evolution is governed by the Vlasov-Poisson system, 
while in Sec. 121 we made the statistical mechanics of a system of point particles described by 
Hamilton equations. In the following, we shall summarize the theory of Lynden-Bell and make 
the connection with the notion of superstatistics. 

Let /o( r , v ) denote the initial (fine-grained) distribution function. We discretize /o( r ? v ) 
in a series of levels 77 on which fo(r,v) — V is approximately constant. Thus, the levels {77} 
represent all the values taken by the fine-grained distribution function. If the initial condition 
is unstable, the distribution function /(r, v, t) will be stirred in phase space (phase mixing) but 
will conserve its values 77 and the corresponding hypervolumes 7(77) = J S(f(r, v,t) — i])d 3 rd 3 v 
as a property of the Vlasov equation (this is equivalent to the conservation of the Casimirs). 
Let us introduce the probability density p(r, v, 77) of finding the level of phase density 77 in a 
small neighborhood of the position r, v in phase space. This probability density can be viewed 
as the local area proportion occupied by the phase level 77 and it must satisfy at each point the 
normalization condition 

(18) J p(r, v, 77)^77 = 1. 

The locally averaged (coarse-grained) distribution function is then expressed in terms of the 
probability density as 

(19) 70, v) = J p(r,v,r})r]dr}, 
and the associated (macroscopic) gravitational potential satisfies 

(20) A$ = 4ttG I /d 3 v. 



Since the gravitational potential is expressed by space integrals of the density, it smoothes out 
the fluctuations of the distribution function, supposed at very fine scale, so $ has negligible 
fluctuations (we thus drop the bar on <£>). The conserved quantities of the Vlasov equation 
can be decomposed in two groups. The mass and energy will be called robust integrals because 
they are conserved by the coarse-grained distribution function: M[f] = M\f] and E[f] ~ E[f]. 
Hence 

(21) M = I Jd 3 rd\, 



(22) E = J ^fv 2 d 3 rd 3 v + \Jf ®d 3 rd\. 

As discussed above, the gravitational potential can be considered as smooth, so we have ex- 
pressed the energy in terms of the coarse-grained fields / and <£> neglecting the internal energy 
of the fluctuations /$. Therefore, the mass and the energy can be calculated at any time of 
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the evolution from the coarse-grained field /. By contrast, the moments M n with n > 2 will 
be called fragile integrals because they are altered on the coarse-grained scale since f n ^ f ■ 
Therefore, only the moments of the fine-grained field M^ 9, = M n [f] = j f n d 3 rd 3 v are con- 
served, i.e. 

(23) Ml; 9 ' = J p(r, v, 7])r] n d 3 rd 3 vdr]. 

The moments of the coarse-g rained field M£ a -[f] = J f d 3 rd 3 v are not conserved along the 
evolution since M n [f] ^ M n [f]. In a sense, the moments M^ 9 - are "hidden constraints" be- 
cause they are expressed in terms of the fine-grained distribution p(r, v, 77) and they cannot be 
measured from the coarse-grained field. They can be only computed from the initial conditions 
before the system has mixed or from the fine-grained field. Since in many cases we do not 
know the initial conditions nor the fine-grained field, they often appear as "hidden". Note 
that instead of conserving the fine-grained moments, we can equivalently conserve the total 
hypervolume 7(77) = j pd 3 rd 3 v of each level 77. 

After a complex evolution, we may expect the system to be in the most probable, i.e. most 
mixed state, consistent with all the constraints imposed by the dynamics (see, however, Sec. 
13. 5|) . We define the mixing entropy as the logarithm of the number of microscopic configurations 
associated with the same macroscopic state characterized by the probability density p(r, v, 77). 
To get this number, we divide the macrocells (r, r + dv; v, v + dv) into v microcells of size h 
and denote by the number of microcells occupied by the level r\j in the z-th macrocell. Note 
that a microcell can be occupied only by one level rjj. This is due to the fact that we make 
the statistical mechanics of a continuous field f(r, v, t) instead of point mass stars as in Sec. 121 
Therefore, we cannot "compress" that field, unlike point-wise particles. A simple combinatorial 
analysis indicates that the number of microstates associated with the macrostate {n^} is 



(24) W({ nij }) = 11^11 



TJ ■ •' 



where Nj = J2i n ij * s the total number of microcells occupied by r]j (this is a conserved quantity 
equivalent to 7(77)). We have to add the normalization condition ^ . = u, equivalent to Eq. 
(|18|) which prevents overlapping of different levels (we note that we treat here the level 77 = 
on the same footing as the others). This constraint plays a role similar to the Pauli exclusion 
principle in quantum mechanics. Morphologically, the Lynden-Bell statistics (124)) corresponds 
to a 4 th type of statistics since the particles are distinguishable but subject to an exclusion 
principle There is no such exclusion for the statistical equilibrium of point mass stars 

since they are free a priori to approach each other, so we can put several particles in the same 
microcell. 

Taking the logarithm of W and using the Stirling formula, we get 

(25) In W({riij}) = - In ^ vh 3 p {j In p ii 

where pjj = p(r^, v*, 77^-) = 7iy /vh 3 . Passing to the continuum limit v — > 0, we obtain the 
Lynden-Bell mixing entropy 

(26) S l .b. [p] = ~ J p(r, v, 77) In p(r, v, r])d 3 rd 3 vdrj. 

Note that the Lynden-Bell entropy can be interpreted as the Boltzmann entropy for a distribu- 
tion of levels 77 (including 77 = 0). Equation (|26|) is sometimes called a collisionless entropy to 
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emphasize the distinction with the collisional entropy (jlj) of Sec. 121 Assuming ergodicity or "ef- 
ficient mixing" (which may not be realized in practice, see Sec. I3.5|) . the statistical equilibrium 
state is obtained by maximizing S[p] while conserving mass M, energy E and all the Casimirs 
(or moments M n ). We need also to account for the local normalization condition (|18|). This 
problem is treated by introducing Lagrange multipliers, so that the first variations satisfy 

(27) 5S - (35E - aJM n - j C(r, v)6 ( j p(r, v, V )dr,) d 3 rd\ = 0, 

n>l J \J J 

where (3 is the inverse temperature and a n the "chemical potential" associated with M n . The 
resulting optimal probability density is a Gibbs state which has the form 

(28) p(r,v, V ) = ^ X (v)e~ We+a)v , 

where e = y + $ is the energy of a star by unit of mass. In writing Eq. (J28)) . we have dis- 
tinguished the Lagrange multipliers a and (3 associated with the robust integrals M and E 
from the Lagrange multipliers a n> i, associated with the conservation of the fragile moments 
M n> i = f pr] n dr]d 3 rd 3 v, which have been regrouped in the function x(v) = exp(— J2n>i a n T T\ 
This distinction will make sense in the following. Under this form, we see that the equilib- 
rium distribution of phase levels is a product of a universal Boltzmann factor e _( - /3e+a - )?? by a 
non-universal function x{v) depending on the initial conditions. The partition function Z is 
determined by the local normalization condition j pdrj = 1 leading to 

(29) Z = [ x(v)e~ v(pe+a) dr). 



Finally, the equilibrium coarse-grained DF defined by / = J pr/dr/ can be written 

{ ' 1 Jx{v)e-^+^dr] ' 

or, equivalent ly, 

(31) J=~^f = E((3e + a)=J(e). 

It is straightforward to check that this coarse-grained distribution depending only on the energy 
e is a stationary solution of the Vlasov equation |2Sj- Thus, for a given initial condition, the 
statistical theory of Lynden-Bell selects a particular stationary solution of the Vlasov equation 
(most mixed) among all possible ones (an infinity!). Incidentally, the fact that the coarse- 
grained DF should be a stationary solution of the Vlasov equation is not obvious; this depends 
on the definition of coarse-graining, see j2H]- Specifically, the equilibrium state is obtained by 
solving the differential equation 



(32) A$ = AttG j f an , p (- + $Kv, 

and relating the Lagrange multipliers a n , (3 to the constraints M n , E. We note that the coarse- 
grained distribution function /(e) can take a wide diversity of forms depending on the function 
x(v) determined by the fragile moments ("hidden constraints"). Some examples will be given 
in Sec. IH.4I In the present context, the function x{v) is determined from the constraints a 
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posteriori. Indeed, we have to solve the full problem in order to get the expression of x{v)- 
In this sense, the constraints associated with the conservation of the fine-grained moments 
are treated microcanonically. We emphasize that the function /(e) depends on the detail of 
the initial conditions unlike in ordinary statistical mechanics where only the mass M and 
the energy E matter. Here, we need to know the value of the fine-grained moments M^ 9 ' 
which are accessible only in the initial condition (or from the fine-grained field) since the 
observed moments are altered for t > by the coarse-graining as the system undergoes a 
mixing process [M^ 9 ' 7^ ^l' 9 ')- This makes the practical prediction of /(e) very complicated, 
or even impossible, since we often do not know the initial conditions in detail (e.g., for the 
formation of elliptical galaxies). In addition, in many cases, we cannot be sure that the initial 
condition is not already mixed (coarse-grained). If it has a fine-grained structure, this would 
change a priori the prediction of the metaequilibrium state. 

We note that the coarse-grained DF predicted by Lynden-Bell depends only on the individ- 
ual energy e of the stars. According to the Jeans theorem [23] , such distribution functions form 
just a particular class of stationary solutions of the Vlasov equation, corresponding to spherical 
stellar systems (they even correspond to a sub-class of spherical systems whose general distri- 
bution function depends on energy e and angular momentum r x v). From this simple fact, it 
is clear that the statistical theory of violent relaxation is not able to account for the triaxial 
structure of elliptical galaxies. More general stationary solutions of the Vlasov equation can 
arise in case of incomplete violent relaxation and they differ from Lynden-Bell's prediction (see 
Sec. 13. 5|) . We also note that /(e) is a monotonically decreasing function of energy. Indeed, 
from Eqs. (j2Hjl and (pH|) . it is easy to establish that 

(33) f(e) = -0/ a , f 2 = J p(v ~ Ifdr) > 0, 

where f% is the centered local variance of the distribution p(r, v, 77). Therefore, /(e) < 
since (3 > is required to make the velocity profile normalizable. Finally, the coarse-grained 
distribution function satisfies f(r, v) < f™ ax where f™ ax is the maximum value of the initial 
(fine-grained) distribution function. This inequality can be obtained from Eq. (|3Uj) by taking 
the limit e — > — 00 for which /(e) — > r] max = f™ ax and using the fact that /(e) is a decreasing 
function. Of course, the inequality < / < f™ ax is clear from physical considerations since 
the coarse-grained distribution function locally averages over the fine-grained levels. Since 
the fine-grained distribution function is conserved by the Vlasov equation, the coarse-grained 
distribution function is always intermediate between the minimum and the maximum values of 
/q. Finally, we note that Lynden-Bell's distribution ([3"U|) does not lead to a segregation by mass 
since the individual mass of the particles does not appear in the Vlasov equation on which the 
whole theory is based; however, it leads to a segregation by phase levels 77. 

If the initial DF takes only two values /o = and /o = rjo, the statistical prediction of 
Lynden-Bell for the metaequilibrium state is 

(34) 7= ^ 

which is similar to the Fermi-Dirac distribution ^21 El- This has to be contrasted from the 
statistical equilibrium state (for t — ► +00) of the one component self-gravitating gas which is 
the Maxwell-Boltzmann distribution 

(35) / = Ae~ Pme . 

In the dilute limit of Lynden-Bell's theory / -C T] (which may be a good approximation for 
elliptical galaxies, see [12]), the DF ([Sljl becomes 

(36) / = A'e~^ 0€ . 
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This is similar to the statistical equilibrium state (|35j) of the iV-stars system. Therefore, in this 
approximation, collisional and collisionless relaxation lead to similar distribution functions (the 
Maxwell-Boltzmann distribution) but with a completely different interpretation, corresponding 
to very different timescales. To emphasize the difference, note in particular the bar on / in Eq. 

and the fact that the mass of the individual stars m in (J3*5|) is replaced by the value 770 of 
the fine-grained distribution function. 

3.3 Generalized entropies 

We have seen that the most probable local distribution of phase levels p(r, v, rj) maximizes 
the mixing entropy ()26|) while conserving mass, energy and all the fine-grained moments. This 
functional of p is the proper form of Boltzmann entropy in the context of violent relaxation. It is 
obtained by a combinatorial analysis taking into account the specificities of the collisionless evo- 
lution. We shall now show that the most probable coarse-grained distribution function /(r, v) 
(which is the function directly accessible to the observations) maximizes a certain functional 
S[f] at fixed mass M and energy E. This functional of / will be called a "generalized entropy" 
(in a sense different to that given by Tsallis). It is non- universal and depends on the initial 
conditions. It is determined indirectly by the statistical theory of Lynden-Bell and cannot 
be obtained from a combinatorial analysis, unlike S[p\. Such generalized (non-Boltzmannian) 
functionals arise because they encapsulate the influence of fine-grained constraints (Casimirs) 
that are not accessible on the coarse-grained scale. They play the role of "hidden constraints" 
in our general interpretation of non-standard entropies. We note that the entropic functionals 
S[p] and S[f] are defined on two different spaces. The p-space is the relevant one to make the 
statistical mechanics of violent relaxation JTR [TbT/ . The f -space is a sort of projection of the 
p-space in the space of directly observable (coarse-grained) distributions. 

Since the coarse-grained distribution function /(e) predicted by the statistical theory of 
Lynden-Bell depends only on the individual energy and is monotonically decreasing, it extrem- 
izes a functional of the form 



at fixed mass M and energy E, where C(f) is a convex function, i.e. C" > 0. Indeed, 
introducing Lagrange multipliers and writing the variational principle as 



Since C is a monotonically increasing function of /, we can inverse this relation to obtain 



(37) 





SS - (55E - aSM = 0, 



we find that 



(39) 



C"(/) = -/3e-a. 



(40) 




where 



(41) 




From the identity 



(42) 



— / 
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resulting from Eq. ()39|). /(e) is a monotonically decreasing function of energy (if (5 > 0). Thus, 
Eq. ()31|) is compatible with Eq. ()40|) provided that we use the identification ([4*T]) . Therefore, 
for any function F(x) determined by the function x{v) m ^ ne statistical theory, we can associate 
to the metaequilibrium state (|3"Tj) a generalized entropy (|3*Tj) where C(/) is given by Eq. (jUJ) 
or equivalently by 

r 7 

(43) C(/) = - / F-\x)dx. 

It can be shown furthermore that the coarse-grained distribution (J31|) maximizes this gener- 
alized entropy at fixed energy E and mass M (robust constraints) 1 . We note that C(f) is 
a non-universal function which depends on the initial conditions. Indeed, it is determined 
by the function xiv) which depends indirectly on the initial conditions through the compli- 
cated procedures discussed in Sec. 13.21 In general, S[f] is not the Boltzmann functional 
SbU] = — f f hi fd 3 rd 3 v (except in the dilute limit of the theory) due to fine-grained con- 
straints (Casimirs) that modify the form of entropy that we would naively expect. This is 
why the metaequilibrium state is described by non-standard distributions (even for an assumed 
ergodic evolution). The existence of "hidden constraints" (here the Casimir invariants that 
are not accessible on the coarse-grained scale) is the physical reason for the occurrence of non- 
standard distributions and "generalized entropies" in our problem. In fact, the distribution is 
standard (Boltzmann-Gibbs) at the level of the local distribution of fluctuations p(r, v, r/) (p- 
space) and non-standard at the level of the macroscopic coarse-grained field f(r, v) (f -space). 
We emphasize that the generalized entropies, which are maximized by the coarse-grained dis- 
tributions, are phenomenological in nature. The point here is that generalized entropies arise 
because we want to phenomenologically extend the maximum entropy principle at the level of 
coarse-grained distributions. 



3.4 Connection with superstatistics 

We would like now to point out some connections between coarse-grained distribution functions 
and superstatistics. Setting E = f3e + a, we can rewrite the "partition function" in the 
form 

r+oo 

(44) Z(E) = / X (v)e~ vE dr)- 

Jo 

This is the Laplace transform of x(v)- Therefore, the partition function Z(E) = x(E) can 
be used as a generating function for constructing the moments of the fine-grained distribution 
[23 EH]- The coarse-grained distribution is given by 

i r+oo 

(45) f(E) = J7^J x(v)ve- vE d V . 

We note that the Lynden-Bell statistics has a form similar to the superstatistics P(E) = 
Jo + °° f(.P) e ~ l3E d/3 of Beck & Cohen [TU] provided that we identify the distribution of temperature 
f(P) to the distribution of phase levels x(v)- Formally, the distribution P(E) is expressed as a 
Laplace transform like the partition function Z(E). However, physically, one should focus on 
the coarse-grained distribution (|4~5j) as being the superstatistics in the present context rather 

lr This implies that / is dynamically stable (nonlinearly) via the Vlasov-Poisson system. Our discussion 
implicitly assumes that the system is confined within a box so as to avoid the infinite mass problem (Sec. 13.51) . 
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than the partition function (J44j) . These coarse-grained distributions do not exactly have the 
form considered by Beck and Cohen, but this is a minor point. Super statistics is an idea 
foremost, not a proposition for a fixed form of average distribution. The real point is that the 
coarse-grained distributions do arise as averages (of some sort) of fine-grained distributions of 
Boltzmann's type and so are superstatistics. 

Due to these formal and physical analogies, we can transpose the results of Beck & Cohen 
[TU] to the context of violent relaxation. However, in the present case, the physical distribution 
is given by 



(46) 



f(E) 



dlnZ 
~dE~'' 



instead of P(E). Therefore, for the same f((3) and x(v)j the distributions P(E) and f(E) will 
differ because of this logarithmic derivative. In addition, we must require that the distribution 
f(E) is integrable, i.e. the spatial density p = f fdv must exist. We note finally that the 
generalized entropy associated to the coarse-grained distribution f\E) is determined by the 
relation 



(47) 



C'(f) 



-E, 



where the function / = f(E) is specified by Eq. (J4I3J) depending on x(v)- Therefore, C(f) is 
obtained by inverting the relation / = — (In Z)'(E) and integrating the resulting expression. In 
mathematical terms, we get the nice formula defining the generalized entropy 



(48) 



C(f) 



i 



[(\nx)r\-*)dx. 



Interestingly, the notion of generalized entropy that we gave in the context of violent relaxation 
in [Oj is similar to the one given independently by Tsallis & Souza |TIJ in the context of 
superstatistics and by Almeida in the context of generalized thermodynamics [HI]. Let us now 
consider particular examples similar to those given by Beck & Cohen ^U] • These examples are 
given essentially to illustrate the fact that different forms of non-standard distributions can 
emerge on the coarse-grained scale. We do not claim that they have any particular physical 
meaning (except (ii)). Furthermore, many other examples of distributions and generalized 
entropies could be constructed. 

(i) Uniform distribution: We take xiv) = 1/6 for < i] < b and \ = otherwise. Then 



(49) 
and 



Z(E) = -{l- e 



-bE\ 



(50) 



This distribution satisfies f(E) — > b for E 
Since / ~ v ~ 2 for v — > +oo, the density p - 
(ii) 2-levels distribution: We take xiv) 



bE' 



E 1 - e 

-> -oo, 7(0) = 6/2 and J(E) ~ E' 1 for E 
J fdv exists only in d = 1 dimension. 
= \8{rf) + \8{r]-b). Then 



+oo. 



(51) 
and 
(52) 



Z(E) = ~(l + e-»*), 



f(E) = j 



+ e 



bE ' 
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This is similar to the Fermi-Dirac distribution ^21 El- We have f(E) — > 6 for E —> — oo, 

v 2 

/(0) = 6/2 and /(-E 1 ) ~ e -6 ^ for E — * +00. Since / ~ e _fc ^~ for v — > +00, the density 
p — J fdv exists in any dimension. Inverting the relation we get 



(53) -£ = ~ 



After integration, we obtain 



ln/-ln(6-/) 



C'(f). 



(54) .[ 7 ]^-/{{ln{ + (l-{)ln(l-{)}^3 V; 

which is similar to the Fermi-Dirac entropy. Note that for this two-levels distribution, the 
generalized entropy (|54|) in /-space coincides with the mixing entropy (|26j) in p-space since 
p(r,v,T]) = (/ /b)5(rj — b) + (1 — f /b)5(rj). This is because the distribution of phase levels 
p(r, v, 77) = p(r,v)5(ri — 6) + p'(r, v)8(j]) can be expressed in terms of the coarse-grained dis- 
tribution function f(r, v) = p(r, v)6, using the normalization condition p + p' = 1. This is the 
only case where we have the equivalence between the mixing entropy S[p] and the generalized 
entropy S[f}. The fact that the 'averaged' Shannon entropy (j2T?j) and the generalized entropy 
(137)) are different in general has also been noted by Beck in a different context. 
(in) Gamma distribution: We take 

in P -v/b 



with c > and b > 0. Note that the case c = 1 corresponds to the exponential distribution 
while b — > +00 corresponds to a power law. Then 

(56) Z(E) = (1 + bE)' c . 

As noted by Beck Sz Cohen ^21, this is similar to Tsallis g-distribution (with q < 1). However, 
in our context, the physical distribution is 

(57) J(E) 



1 + bE 



It is defined only for E > —1/6. Furthermore, f(E) ~ cE 1 for E — > +00 so that the spatial 
density exists only in d — 1 dimension. Inverting the relation (|57|) . we get 

1 / c6 

(58) -£ = l(l-^ 

After integration, we obtain 



6 v 7 



(59) cU) = {-c\nf. 

Note that the first term can be absorbed in the Lagrange multiplier a associated with the mass 
conservation so that the relevant generalized entropy is 

(60) S[f}= [ ln7d 3 rd 3 v. 
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It could be called the log-entropy. Note that when E = v 2 /2, the corresponding distribution 
function (|57|) is the Lorentzian. In a sense, the log-entropy can be viewed as a continuation of 
Tsallis entropy for q = (see Eq. (J7UJ)). This suggests to introducing the modified functional 

(61) ^/] = -^|Qr-/)rf 3 rd 3 v, 

which has properties similar to the Tsallis functional for q ^ and which reduces to Eq. (|60|). 
leading to the distribution (|57ji. for g — > 0. More precisely, S^/] = f In /<i 3 r<i 3 v + ^ + 0(q) 
for g — > 0, where if is a constant. Taking the variations SS q at fixed mass and energy leads to 
/ = (1 — (q — l)^) 1 /^ 1 ) which passes to the limit for g — f 0, unlike Eq. (|7Uj) . 

(iv) Gaussian distribution: We take 

(62) x(77 ) = 2^yv^ 

with 7 > 0. Then, 

(63) Z(E) = e^erfcf J. 
The corresponding coarse-grained distribution can be written 



(64) 



/(£) = _LhY JL ] i/(x) = x{^ — L____ii. 

v/7 V 2 ^/ I 0Fze* a erfc(a:) J 



This distribution satisfies /(£') ~ — ^ for £/ — > — oo and f{E) ~ -E 1-1 for E — > +oo. The 
density exists only in (i = 1 dimension. 

In the examples considered above, only the Fermi-Dirac distribution function is relevant 
for self-gravitating systems since the density p = J fdv is not defined for the others in d = 3 
dimensions. However, these examples may still be of interest in physics because the theory of 
violent relaxation is valid for other systems with long-range interactions described by the Vlasov 
equation (221 • The foregoing distributions may thus be relevant for one- dimensional systems. 
They can also be relevant in 2D turbulence (see Sec. 0} where the energy e = v 2 /2 + $(r) is 
replaced by the stream function ip(r), so that there is no condition of normalization equivalent 
to J fdv < oo. 

The E~ l behaviour of f(E) for E — > +oo arises because we have assumed that the function 
x{v) i s regular at r] = 0. In fact, the level rj = plays a particular role in the theory because it 
corresponds to the "vaccum" which has a very large phase space extension and which can mix 
with the non-zero levels. Therefore, we expect that x(v) ~^ Xo${v) f° r V ~ > 0- As a consequence, 
the level rj = should be treated specifically, and a more physical form of partition function, 
which isolates the contribution of r] = 0, would be 

P+OO 

(65) Z{E) = 1 + / X (v)e- vE dri, 



where a > 0. Note that we can take Xo — 1 without restriction of generality so that the value 
of 7(0), which is infinite, never appears in the theory. If we reconsider example (i) with now 
X = 1/(6 — a) for a < rj < b and x — otherwise, we get 

p -aE _ -bE 1 rp( np -aE _ u -bE\ 

(66) 7(E) = 



E[E(b -a) + e~ aE - e~ bE ] 
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If a 7^ (gap), the DF decreases as / ~ a/(b — a)E~ 1 e~ aE for E — > +00 and if a = (no gap) 
as / ~ (l/b)E~ 2 . The density profile p = J fdv is now well-defined in d = 3. If we reconsider 
example (ii) with we get 

(67) 7(/--') 



1 + &£)[(! + 6£) c + l]' 



which decreases as / ~ cb C E ( c+1 ). We think that the particularity of the level 77 = is an 
important point that deserves further consideration. 



3.5 Incomplete relaxation, Tsallis entropies and iif-functions 

The statistical approach presented previously rests on the assumption that the collisionless 
mixing is efficient so that the ergodic hypothesis which sustains the statistical theory is fulfilled. 
In reality, this is not the case. It has been understood since the beginning that violent 
relaxation is incomplete so that the mixing entropy (j2fi|l is not maximized in the whole phase 
space and real stellar systems are not described by Lynden-Bell's statistics. In fact, for stellar 
systems, violent relaxation cannot be complete because there is no maximum entropy state 
in an unbounded domain. The generalized isothermal distribution functions (|30j) predicted 
by Lynden-Bell, when coupled to the Poisson equation, yield density profiles whose mass is 
infinite (the density decreases as r~ 2 at large distances). But this mathematical problem is 
rather independent from the physical reason why violent relaxation is incomplete. Physically, 
real stellar systems tend towards the maximum entropy state during violent relaxation but 
cannot attain it because the gravitational potential variations die away before the relaxation 
process is complete. Thus, for dynamical reasons, the system will not explore the whole phase 
space ergodically as discussed in fHJ 122] • However, since the Vlasov equation admits an infinite 
number of stationary solutions, the coarse-grained distribution / can be trapped in one of them 
and remain frozen in that state until collisional effects come into play (on longer timescales). 
This steady solution is not, in general, the most mixed state (it is only partially mixed) so it 
differs from Lynden-Bell's statistical prediction. The concept of incomplete violent relaxation 
explains why galaxies are more confined than predicted by statistical mechanics (the density 
profile of elliptical galaxies decreases as r~ A instead of r~ 2 |25j). 

In order to quantify the importance of mixing, Tremaine, Henon & Lynden-Bell j2H| have 
introduced the notion of if-functions. They are defined by 

(68) H[1] = ~J CW^v, 

where C is any convex function. It can be shown that the if -functions H[f] calculated with 
the coarse-grained distribution function increase during violent relaxation in the sense that 
H[f(r, v, t)] > H[f(r, v, 0)] for t > where it is assumed that, initially, the system is not mixed 
so that /(r, v, 0) = /(r, v, 0). This is similar to the if-theorem in kinetic theory. However, 
contrary to the Boltzmann equation, the Vlasov equation does not single out a unique functional 
(the above inequality is true for all if -functions) and the time evolution of the fi-functions is 
not necessarily monotonic (nothing is implied concerning the relative values of H(t) and H(t') 
for t,t' > 0). Yet, this observation suggests a notion of generalized selective decay principle: 
among all invariants of the collisionless dynamics, the if-functions (fragile constraints) tend 
to increase (-H decrease) on the coarse-grained scale while the mass and the energy (robust 
constraints) are approximately conserved. According to this phenomenological principle, we 
might expect (see however the last paragraph of this section) that the metaequilibrium state 
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reached by the system as a result of incomplete violent relaxation will maximize a certain 
//-function (non-universal) at fixed mass and energy. Repeating the calculations of Sec. 13.31 
with H[f] instead of S[f], the extremization of a //-function at fixed E and M determines 
a distribution function / = /(e) with / (e) < which is a stationary solution of the Vlasov 
equation (recall that our argument applies to the coarse-grained distribution). Moreover, if 
the DF maximizes the //-function at fixed E and M, then it is nonlinearly dynamically stable 
with respect to the Vlasov- Poisson system [2*01 IT"""! I"""""] 2 . In general, the //-function H*[f] 
that is effectively maximized by the system as a result of incomplete violent relaxation (if 
any) is difficult to predict jOJ. It depends on the initial conditions (due to the Casimirs) and 
on the efficiency of mixing. If mixing is complete (as may be the case for systems others 
than gravitational ones), the //-function that is maximized at equilibrium is the generalized 
entropy (["""*"[) . hence H*[f] = S[f], and the stationary distribution function is the Lynden-Bell 
distribution (|31|). If mixing is incomplete, H*[f] and /(e) can take forms that are not compatible 
with the expressions (["""""[) and ()31|) derived in the statistical approach. 
In the context of incomplete violent relaxation, the Tsallis functional 

(69) Sq [f] = -^- f(f q - f)d 3 rd 3 v, 

is a particular //-function whose maximization at fixed mass and energy leads to distribution 
functions of the form 



(70) /r,v = fj, e 

Q 

These distribution functions characterize stellar polytropes [25J. They are particular station- 
ary solutions of the Vlasov equation. For q > 1, the polytropic distribution functions have 
a compact support (they vanish at a maximum energy e max ) unlike the Lynden-Bell distribu- 
tion functions (|31|) whose tails extend to infinity. Stellar polytropes with index n < 5 (where 
n — 3/2 + l/(g — 1)) describe confined structures with finite mass, unlike isothermal stellar 
systems. They have been studied for a long time in astrophysics as simple mathematical models 
of stellar systems. Unfortunately, pure polytropic distributions do not provide a good model 
of incomplete violent relaxation for elliptical galaxies [22] ■ An improved model is a composite 
model that is isothermal in the core (justified by Lynden-Bell's theory of violent relaxation) and 
polytropic in the halo (due to incomplete relaxation) with an index n = 4 [331 EH! • Since the 
maximization principle determining the nonlinear dynamical stability of a collisionless stellar 
system (maximization of a //-function at fixed mass and energy) is similar to the maximiza- 
tion principle determining the thermodynamical stability of a collisional stellar system (max- 
imization of the Boltzmann entropy at fixed mass and energy) we can use a thermodynamical 
analogy and develop an effective thermodynamical formalism (E.T.F.) to analyze the nonlinear 
dynamical stability of collisionless stellar systems [0JE3[^]- We emphasize, however, that the 
maximization of a //-function at fixed mass and energy is a condition of nonlinear dynamical 
stability for the Vlasov equation, not a condition of thermodynamical stability. Therefore, this 
thermodynamical analogy is purely formal. In particular, in the context of violent relaxation, 
Tsallis functional S q [f] is a particular //-function, not an entropy. 

If we were to apply Tsallis generalized thermodynamics in the context of violent relaxation, 

2 During mixing Df/Dt ^ and the ff-functions H[f] increase. Once it has mixed Df/Dt = so that 
H[f] = 0. Since /(r,v, t) has been brought to a maximum / (r,v) of a certain iJ-function and since H[f) is 
conserved (after mixing), then f is a nonlinearly dynamically stable steady state of the Vlasov equation. 
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we would need to replace the Lynden-Bell entropy by the g-entropy 
(71) S q [p] = --^-j J (p*(r, v, 77) - p(r, v, r]))d 3 rd\dr,, 

as argued in [33]. The generalized mixing entropy S q [p], which is a functional of the proba- 
bility p(r, v, 77), would be the proper form of g-entropy in that context, taking into account 
the specificities of the collisionless dynamics. For q — > 1, it returns the Lynden-Bell entropy 
f[26[) . For q ^ 1, it could take into account incomplete mixing and non-ergodicity. In that 
context, the q parameter could be interpreted as a measure of mixing and Tsallis entropy could 
be interpreted as a functional attempting to take into account non-ergodicity in the process 
of incomplete violent relaxation. Maximizing S q [p] at fixed mass, energy and Casimirs, we 
obtain a (/-generalization of the Gibbs state (|28|). This maximization principle is a condition 
of thermodynamical stability (in Tsallis generalized sense) in the context of violent relaxation. 
Then, we can obtain a g-generalization of the equilibrium coarse-grained distribution function 
(p!T]) in a fashion similar to that of Sec. 13.21 after introducing proper averaging procedures 
(e.g., g-expectation values). For appropriate values of q, these distribution functions will have 
finite mass contrary to Lynden-Bell's distribution. We shall not try, however, to develop this 
generalized formalism in more detail here. Note that in the case of two levels / G {0,770}, and 
in the dilute limit of the theory f <^r) , S q [p] can be written in terms of the coarse-grained dis- 
tribution / = pT/o in the form S q [f] = — -^1 /[(/ /Vo) q ~ (f /Vo)]d 3 rd 3 v. In this particular limit, 
Tsallis functional S q [f] could be interpreted as a generalized entropy (not just a if-function). 
Therefore, Tsallis functional S q [p] expressed in terms of p(r, v, 77) is a generalized entropy while 
Tsallis functional S q \f] expressed in terms of /(r, v) is either a if-function (dynamics) or a par- 
ticular case of entropy S q [p] (thermodynamics) for two levels in the dilute limit. However, it is 
not clear why complicated effects of non-ergodicity (incomplete mixing) could be encapsulated 
in a simple functional such as (J71j) . Indeed, other functionals of the form S = — j C(p)dr]drd\ 
where C is convex could be considered as well. As discussed above, the observations of galaxies 
do not support the prediction of non- extensive thermodynamics obtained by maximizing Tsallis 
q- entropy J7i| ). Furthermore, it is not clear whether the idea of changing the form of entropy 
in case of incomplete relaxation is the most relevant. An alternative approach developed in 
fHJE2] is to keep the Lynden-Bell entropy (|2T)j) unchanged but describe the dynamical evolution 
of p(r, v, t) by a relaxation equation of the form 



with a diffusion coefficient D(r, v,t) going to zero for large time (as the variations of the 
gravitational potential $ decay) and in regions of phase-space where the fluctuations 5$ are not 
strong enough to provide efficient mixing. The vanishing of the diffusion coefficient can "freeze" 
the system in a subdomain of phase space and account for incomplete relaxation and non- 
ergodicity. In general, the resulting state, although incompletely mixed, is not a g-distribution. 
This approach in interesting because it is not based on a generalized entropy, so there is no free 
parameter like q or C(p). However, it demands to solve a dynamical equation (|72*|) to predict 
the equilibrium state. The idea is that, in case of incomplete relaxation (non-ergodicity), the 
prediction of the equilibrium state is impossible without considering the dynamics. 

We would like to emphasize again the distinction between entropies and if-functions. An 
entropy is a quantity which is proportional to the logarithm of the disorder, where the dis- 
order is equal to the number of microstates consistent with a given macrostate. This is how 
the Lynden-Bell entropy (j2HJ) has been defined. Tsallis entropy (J7TJ) could be considered as a 
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generalization of this definition in the case where the phase-space has a complex structure so 
that the evolution is non-ergodic. In each case, the entropy is a functional of the probability 
p(r, v, if) and the maximization of these entropies at fixed mass, energy and Casimirs is a con- 
dition of thermodynamical stability. The if-functions do not have a statistical origin. They are 
just arbitrary functionals of the coarse-grained distribution /(r, v, t) of the form (|68|). They are 
useful to characterize the degree of mixing of a collisionless stellar system j2D]- Furthermore, 
their maximization at fixed mass and energy provides a condition of nonlinear dynamical sta- 
bility with respect to the Vlasov equation. Finally, the "generalized entropies" (J3*7)l defined in 
Sec. 13 .31 can be regarded as entropies which are proportional to the logarithm of the number of 
microstates consistent both with a given macrostate and with the constraints imposed by the 
Vlasov equation (Casimirs). Their functional form depends on the initial condition. They are 
defined on a projection space (/-space) where a macrostate is defined by the specification of 
/(r, v) instead of p(r, v, 77). 

Finally, we note that the maximization of the Lynden-Bell entropy (|26jh of the Tsallis 
entropy (J7TJ) or of a H-function (|55|l leads to a distribution function of the form / = /(e) 
with /(e) < depending only on the energy. These DF can only describe spherical stellar 
systems (and even a sub-class of them) [23]. In reality, stellar systems are not spherical and 
their distribution functions are not function of the energy alone. Indeed, according to the 
Jeans theorem , there exists more general stationary solutions of the Vlasov equation which 
depend on other integrals of motion. This indicates that the structure of the final state of 
a collisionless stellar system depends on its dynamical evolution in a complicated manner. 
An important problem in astrophysics is therefore to find the form of distribution function 
appropriate to real galaxies. Simple concepts based on entropies and if-functions are not 
sufficient to understand the structure of galaxies. This is particularly deceptive. However, 
conceptually, the theory of violent relaxation is important to explain how a collisionless stellar 
system reaches a steady state. This is due to phase mixing in phase space. The coarse-grained 
DF /(r, v, t) reaches a steady state /(r, v) in a few dynamical times while the fine-grained 
distribution function /(r, v, t) develops filaments at smaller and smaller scales and is never 
steady (presumably). Since this mixing process is very complex, the resulting structure /(r, v) 
should be extremely robust and should be therefore a nonlinearly dynamically stable stationary 
solution of the Vlasov equation. Thus, the theory of incomplete violent relaxation explains how 
collisionless stellar systems can be trapped in nonlinearly dynamically stable stationary solutions 
of the Vlasov equation on the coarse-grained scale. 

4 Two-dimensional turbulence 

4.1 Statistical mechanics of 2D vortices 

The same ideas apply in 2D turbulence to understand the formation of coherent structures 
(jets and vortices) in large-scale flows. The analogy between stellar systems and 2D vortices 
is discussed in Chavanis [T7j. A statistical theory of point vortices has been first developed by 
Onsager and Joyce & Montgomery This theory predicts the statistical equilibrium 
state of a point vortex gas, reached for t — > +00 after a "collisional" relaxation, assuming 
ergodicity. The most probable vorticity profile is given by 



which is similar to the statistical distribution (|1U|) of a multi-components system of stars (note 
that the vorticity is proportional to the density of point vortices). A kinetic theory of point 



(73) 
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vortices has been developed by Dubin & O'Neil |37| and Chavanis |38| I17j. The collision term 
of the derived kinetic equation, which is the counterpart of the Landau equation (|13|). cancels 
out when the profile of angular momentum is monotonic so that this equation (valid to order 
1/iV) does not relax towards the statistical equilibrium state. This implies that the relaxation 
time scale (if there is ever relaxation) is larger than Ntr>. 

In the limit iV — > +00, the evolution of the system is described by the 2D Euler equation 
(jl00|) which is the counterpart of the Vlasov equation (|T3j) . The statistical mechanics of con- 
tinuous vorticity fields described by the 2D Euler equation has been developed by Miller and 
Robert & Sommeria [T^]. This is similar to the theory of violent relaxation of Lynden-Bell 
^21 El]- In that context, we speak of "inviscid relaxation" or "chaotic mixing". The mixing 
entropy is 

(74) S[p} = - J p(r, a) In p(r, a)d 2 rda, 
and the Gibbs state reads 

(75) p{r,a) = ^ X ^)e-^* +a \ 

with notations similar to those of Sec. 13.21 (here, a labels the vorticity levels). The density 
probability p(r, a) gives the local distribution of vorticity at statistical equilibrium. It maxi- 
mizes the mixing entropy (f71j) at fixed energy E = ~ J ZJipcPr, circulation T = f ujcPy (robust 
constraints) and Casimir constraints or fine-grained moments = j u n d 2 r = J pa n dad 2 r 

(fragile constraints). The partition function can be written 

r+00 

(76) Z= / X (vK am+a) da, 



and the most probable coarse-grained vorticity uj = f pada is related to the stream function 
by a relation of the form 

(77) u = -±^ = F(M + a) = m. 

This is a steady state of the 2D Euler equation where / is monotonic (since f'(ip) = —j3uj2 
with 0J2 = uj 2 — uj 2 > 0, it is increasing at negative temperatures and decreasing at positive 
temperatures). Note that the vorticity levels a can take positive and negative values contrary 
to the case of self-gravitating systems for which rj > 0. Note also that uj is a vorticity field 
not a distribution of particles, unlike / in astrophysics (only in the point vortex model can we 
interprete u as a distribution of particles since it is related to the density of point vortices). 
The most probable coarse-grained vorticity (fTTj) maximizes a generalized entropy 

(78) Sp] = - J C(ZJ)d 2 r, 

at fixed circulation and energy. Indeed, this optimization problem leads to a relation of the 
form 

(79) C'ip) = -I3tp - a, 

which can be identified with Eq. (J77)) with f'(ip) = —/3/C"(uj). This identification relates the 
function C(p) to the function F(x) whose form depends on xi. a ) through Eqs. (JTUJ) and (|77|) . 
Explicitly, we have 



(80) C{u) = - / F-\x)dx. 



-1/ 
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We can also introduce a notion of generalized selective decay principle in 2D turbulence: 
among all inviscid invariants of the 2D Euler equation, the if -functions (fragile constraints) 
H[uj] = — J C(uJ)d 2 r increase (—if decrease) on the coarse-grained scale or in the presence 
of a small viscosity (Appendix EJ while the energy E[uJ] and the circulation r[uJ] (robust 
constraints) are approximately conserved. Therefore, the metaequilibrium state resulting from 
violent relaxation is expected to maximize a certain if -function (non-universal) at fixed energy 
and circulation. This generalizes the usual selective decay principle of 2D turbulence which 
considers the minimization of enstrophy T% = f uj 2 d 2 r at fixed energy and circulation. In 
our approach, minus the enstrophy — r 2 [cJ] = — J ZJ 2 d 2 r and the Tsallis functional S q [u] = 
— -^n f(^ q ~ ui)d 2 r are particular if -functions (note that the enstrophy T2 is a particular case 
of Tsallis functional with q = 2). 

The extremization of a if -function at fixed energy and circulation leads to a stationary 
solution of the 2D Euler equation of the form To = f(ip) where / is a monotonic function 
specified by the convex function C(u). Furthermore, as shown in the condition of maximum 
provides a refined criterion of nonlinear dynamical stability for the 2D Euler-Poisson system 
(the physical interpretation of this criterion applying to the coarse-grained vorticity is the same 
as in the remark of Sec. 13 .5|) . Note that contrary to the Vlasov equation, the relation uj = f(ip) 
is the general form of stationary solution of the 2D Euler equation (for systems with no special 
symmetries) . Therefore, in 2D hydrodynamics, any nonlinearly dynamically stable stationary 
solution of the 2D Euler equation maximizes a if-function at fixed circulation and energy 
(and, possibly, angular momentum and impulse) contrary to the case of the Vlasov equation in 
astrophysics where a more general class of steady solutions exists due to the Jeans theorem. 

Finally, the Tsallis entropy in the context of the 2D Euler equation is a functional of the 
vorticity distribution p(r,o~) of the form S g [p] = — ^zj J(p q — p)dad 2 r generalizing the mixing 
entropy (J73)l [34J. This functional could be an attempt to take into account non-ergodicity in 
the process of violent relaxation of 2D turbulent flows. However, other functionals could be 
considered as well, and Tsallis entropy does not provide a correct description of non-ergodicity in 
all observed cases. This means that the type of mixing in 2D turbulence (and stellar dynamics) 
is more complex than the one (multi-fractal) described by the Tsallis functional [3 . Non- 
ergodicity (incomplete relaxation) can be taken into account dynamically by using relaxation 
equations with a space dependent diffusion coefficient related to the fluctuations jHHl EE] • 

4.2 Prior vorticity distribution 

The statistical approach of Miller-Robert-Sommeria applies to flows that are strictly described 
by the 2D Euler equation. In this point of view, one must conserve the value of all the Casimir 
invariants (or vorticity moments). This leads to the expression (J7HJ) for the most probable 
distribution of vorticity, where the function x( a ) is determined by the initial conditions through 
the value of the Casimir integrals (this is precisely the Lagrange multiplier associated to these 
constraints). However, in geophysics, there exists situations in which the flow is continuously 
forced at small-scales so that the conservation of the Casimirs is destroyed. Ellis et al. JH] 
have proposed to take into account these situations by fixing the function x{ a ) instead of the 
Casimirs. Physically, this prior vorticity distribution can be viewed as a global distribution of 
vorticity imposed by a small-scale forcing. It can be due to convection and 3D effects like in 
the atmosphere of Jupiter. Its specific form has to be adapted to the situation. Then, two- 
dimensional turbulence organizes this global distribution of vorticity into large-scale coherent 
structures. These organized states result from a balance between entropic and energetic effects: 
the system tends to mix but complete mixing, which would result in a uniform distribution, is 
prevented by the energy constraint. The most probable local distribution of vorticity is now 
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obtained by maximizing a relative entropy conditioned by the prior distribution 



(81) 



S xlP\ = ~ P(r,ff)ln 




at fixed circulation and energy (no other constraints). The conservation of the Casimirs has 
been replaced by the specification of a prior distribution x( a )- As shown in Chavanis ^J], the 
relative entropy ffHTj) can be seen as a Legendre transform S x = S — J2 n >i a n^n 9 ' °f the mixing 
entropy (fflj) when the constraints associated with the conservation of the vorticity moments 
(Casimirs) are treated canonically. Indeed, the approach of Ellis et al. [18] amounts to fixing 
the conjugate variables a n> \ instead of the fine-grained moments T^\. If we view the vorticity 
levels as species of particles, this is equivalent to fixing the chemical potentials instead of the 
total number of particles in each species. This assumes that the 2D system is in contact with 
a sort of "reservoir". The forcing and dissipation break the conservation of the Casimirs and 
impose instead a distribution of vorticity. By contrast, the robust constraints (circulation and 
energy) are still treated microcanonically. The maximization of S x at fixed E, V again leads to 
the distribution (|75|1 but with a different interpretation. In the present context, the statistical 
equilibrium state results from an interplay between 3D effects (the non-universal small-scale 
homogeneous forcing encapsulated in the prior x( a )) an d 2D effects (the universal Gibbs factor 
e -a{a+/3tp) gj v j n g r j se t inhomogeneous large-scale structures). The statistical distribution is 
the product of these two effects. The partition function and the most probable coarse-grained 
vorticity field are still given by Eqs. (J75J) and (fTTj) . However, in this new approach, the function 
F(x) is fixed directly by the prior vorticity distribution x(°~) while in the approach of Miller- 
Robert-Sommeria, it has to be related a posteriori to the initial conditions in a complicated 
way. 

The approach of Ellis et al. [18] is very close to the notion of superstatistics since it considers 
that the fluctuations of vorticity x(°") are given a priori by an external process, which is also 
the case for the fluctuations of temperature f(/3) in the Beck-Cohen superstatistics. Therefore, 
the uj — ip relationship and the generalized entropy S\uj] are directly determined by the prior 
vorticity distribution x(°~) through the formula 



where x(E) — /_ °° x{ a ) e ~° E do~ , according to Eqs. (|8U|). (J77|) and (J7HJ). This makes the 
generalized entropy S[uj] an intrinsic quantity. In the present context, it is determined by the 
small-scale forcing (through the prior x) while in the approach of Miller-Robert-Sommeria it 
depends on the initial conditions (through the Casimirs). Furthermore, in the present context, 
S[uJ] really has the status of an entropy in the sense of the large deviation theory. Indeed, 
Ellis et al. [18 j show that the probability of the coarse-grained vorticity field uJ(r) at statistical 
equilibrium can be written in the form of the Cramer formula 



where n is the number of sites of the underlying lattice introduced in their mathematical 
analysis. Therefore, the most probable vorticity field ZJ maximizes S\uJ] at fixed circulation and 
energy. This maximization principle also provides a refined condition of nonlinear dynamical 
stability with respect to the 2D Euler-Poisson system 18] . 
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4.3 Example of generalized entropy 



Let us consider, for illustration, the prior vorticity distribution x(cr) introduced by Ellis et al. 
in their model of jovian vortices. It corresponds to a de-centered Gamma distribution 



(84) 



-R 



a 



where R(z; a) = T(a) z e~ z for z > and R = otherwise. The scaling of x( a ) is chosen 
such that (a) = 0, var(cx) = 1 and skew(cr) = 2e. This distribution is a variant of Gamma 
distribution considered by Beck & Cohen [TU]. Setting E = f3ip + a, we get 



Z(E) = x{E) 



-QnZ)'(E) 



(85) 
and 

(86) ZJ(E) = 
Inversing the relation ()8f)jl . we obtain 

(87) —E 

After integration, we obtain the generalized entropy 

1 



[1 + eE) 1 /' 
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1 + eE 



cm. 



C(uJ) 



LO 



ln(l + ecu) 



This form of entropy can also be obtained from the techniques of the large deviation theory as 
discussed in [IB]. Our approach, leading to the general formula (JH2J), is a simple alternative to 
obtain the generalized entropy C(uJ) associated to the prior vorticity distribution x( a )- O n the 
other hand, for a Gaussian prior distribution 
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we get 
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Therefore, the U — ip relationship is linear and the generalized entropy S[u] = — | J ZJ 2 d 2 r is 
minus the enstrophy. It also corresponds to the limit of Eq. ()88|) for e — >• 0. Other examples of 
prior vorticity distributions are collected in [6 . An example which has not been given previously 
is when x{E) is of the Tsallis form 



(91) 
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-oo, we recover the Gaussian distribution ([8*9^1. For the distribution (|9*T|) . we get 

z{E) = 2( 3+2 ^/V 5 - 2p)/4 v / ^r(j9)| J E;r 1 / 2 " p / 1/2+?3 ( v ^| J E;|). 
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4.4 Generalized Fokker-Planck equations 

In the context of freely evolving 2D turbulence, a thermodynamical parametrization of the 2D 
Euler equation has been proposed by Robert & Sommeria [35] in terms of relaxation equations 
based on a maximum entropy production principle (MEPP). These equations conserve all the 
Casimirs, increase the mixing entropy (|74|) and relax towards the Gibbs state (175)) . In the 
situations considered by Ellis et al. JH] where the system is forced at small scale, we have 
proposed in J5] & n alternative parametrization of the 2D Euler equation. In that case, we 
have seen that only the energy and the circulation (robust constraints) are conserved. The 
conservation of the Casimirs is replaced by the specification of a prior vorticity distribution 
x(c) encoding the small-scale forcing. This fixes a form of generalized entropy (|78|) through 
the formula (JBU)- In that case, we have proposed to describe the large-scale evolution of the 
flow on the coarse-grained scale by a relaxation equation which conserves energy and circulation 
and increases the generalized entropy ()78|) until the equilibrium state f)77j) is reached. This can 
be obtained by using a generalized Maximum Entropy Production Principle. The resulting 
relaxation equation, introduced in jU], has the form of a generalized Fokker-Planck equation 



(93) ^ + u ■ Voo = V ■ { D 



where the evolution of the Lagrange multiplier (3(t) accounts for the conservation of energy. 
Furthermore, the diffusion coefficient can be obtained from a kinetic model leading to D — 
Ke 2 / a/ C"(u) where e is the resolution scale and K is a constant of order unity [15] . In these 
equations, the function C{uj) is fixed by the prior distribution x(°")- These equations are ex- 
pected to be valid close to the equilibrium state in the spirit of Onsager's linear thermodynamics. 
However, they may offer a useful parametrization of 2D flows even if we are far from equilib- 
rium. Alternatively, according to the refined nonlinear dynamical stability criterion of Ellis et 
al. |TH] these relaxation equations can be used as powerful numerical algorithms to compute 
arbitrary nonlinearly dynamically stable stationary solutions of the 2D Euler-Poisson system. 
These ideas are further discussed in jT5j in relation with geophysical flows. We note that forced 
2D turbulence provides a physical situation of interest in which a rigorous notion of generalized 
thermodynamics and generalized kinetics emerges. In our formalism, all the complexity of the 
system is encapsulated in a prior distribution x(°")- We can then determine the generalized 
entropy S\uJ] by using formula ([H2~]) and substitute the result in the relaxation equation (j53~]) to 
obtain the dynamical evolution of the coarse-grained flow. The problem now amounts to finding 
the relevant prior x(°")- Of course, this depends on the situation contemplated. Furthermore, 
for a given situation, it is likely that a whole "class" of priors (or generalized entropies) will 
sensibly give the same results. In practice, one has to proceed by trying and errors to find the 
relevant "class of equivalence" adapted to the situation considered [Sj. 

As discussed previously, the prior x(cr) encodes the small-scale forcing. It is due, e.g., to 
convection (in the jovian atmosphere) or any other complicated process specific to the situation 
contemplated. It is not our goal here to develop a precise model of convection to determine 
a relevant form for x(°")- We shall rather remain at a phenomenological level and propose 
to describe the generation of vorticity fluctuations by general stochastic processes. Since the 
generating process must include a forcing and a dissipation, we consider a generalized Langevin 
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equation of the form introduced in jB]: 




(95) 



where r] (t) is a white noise and C(x) a convex function of the global distribution of vorticity. 
The corresponding (generalized) Fokker-Planck equation is 
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dt 
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Its stationary solution determines the prior vorticity distribution x( a ) through the relation 



(97) 



a 



C\x) = ~b Y 



where b = ^/D is a sort of inverse temperature. For example, when the coefficients of dissipation 
and forcing are constant, corresponding to C(x) = X^ n X an d leading to standard stochastic 
processes, the prior distribution is the Gaussian (|%9"j) leading to a generalized entropy having the 
form of minus the enstrophy S[uJ] = — r 2 [uJ] and to a linear uJ — if) relationship at equilibrium. 
However, our formalism allows to treat more general situations. Furthermore, in the preceding 
discussion, we have implicitly assumed that the prior relaxes more rapidly to its equilibrium 
value than the coarse-grained vorticity field, so that, in Eq. fl9lj|) . the generalized entropy C(lo) 
is calculated from x{°~) = x( cr )+°°)- This is probably a relevant approximation. Otherwise, 
we need to couple the two equations (|9*Hj) and (j9T?j) and determine, at each time, the function 
C(u),t) from the prior x(cx, t), using formula 



5 Conclusion 

In this paper, we have discussed some analogies between coarse-grained distribution functions 
characterizing statistical equilibrium states of collisionless stellar systems or inviscid 2D flows 
and the notion of superstatistics introduced by Beck & Cohen (2003). In particular, we have 
shown that the coarse-grained distribution functions arising in theories of violent relaxation 
can be viewed as forms of superstatistics (albeit different from the Beck-Cohen superstatistics). 
Although the concept of violent relaxation has been introduced by Lynden-Bell (1967) long ago, 
it remains largely unknown in the statistical mechanics community and this is why we have 
exposed this theory in some detail here. Non-standard distributions arise on the coarse-grained 
scale because they are expressed as averages of fine-grained distributions. The observed (coarse- 
grained) distribution function appears to be a superposition of Boltzmann's factors weighted by 
a non-universal function x{v) or x{°)- To each coarse-grained distribution, we can associate a 
generalized entropy. For freely evolving systems, the functions xiv) or x( cr ) an d the generalized 
entropies S[f] or S\uJ] depend on the initial conditions. Alternatively, in certain occasions, it 
may be justified to regard the function x as imposed by some external processes. This prior 
distribution then directly determines the generalized entropy. This approach is particularly 
relevant in the case of geophysical flows that are forced at small scales ^Sl CHI • ^ may also 
be valid in the case of dark matter models in astrophysics were a small-scale forcing can alter 
the conservation of the Casimirs and impose instead a distribution of fluctuations. In these 
cases, the relaxation of the coarse-grained field can be described by generalized Fokker-Planck 
equations where the entropy is determined by the prior x [01 [HI] • Alternatively, these relaxation 
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equations can be used as numerical algorithms to construct arbitrary nonlinearly dynamically 
stable stationary solutions of the Vlasov and Euler equations specified by a convex function C . 

We have also discussed the two successive equilibrium states achieved by a stellar system. In 
a first regime, the evolution is collisionless and the system reaches a metaequilibrium state as a 
result of violent relaxation. This is a nonlinearly dynamically stable stationary solution of the 
Vlasov-Poisson system. On longer timescales, stellar encounters ( "collisions" ) drive the system 
towards the statistical equilibrium state described by the Boltzmann distribution (when the 
escape of stars and the gravothermal catastrophe are prevented) . The metaequilibrium state 
(collisionless regime) and the statistical equilibrium state (collisional regime) correspond to 
quite different processes. They can be written as a superposition of Boltzmann factors for 
each species of particles (collisional equilibrium) or for the different phase levels (collisionless 
equilibrium) . 

In fact, violent relaxation is incomplete in general. A famous example of incomplete relax- 
ation in 2D turbulence is provided by the plasma experiment of Huang & Driscoll [40] . In this 
experiment, the metaequilibrium state resulting from violent relaxation has the form of a self- 
confined vortex surrounded by un-mixed flow. This strong confinement is in contradiction with 
the statistical mechanics of Miller-Robert-Sommeria ^5] which leads to un-restricted vorticity 
profiles. As discussed in Brands et al. [34J, the observed confinement is due to incomplete 
relaxation and lack of mixing/ergodicity. The system has evolved to a stationary solution of 
the 2D Euler equation which is not the most mixed state. Now, any nonlinearly dynamically 
stable stationary solution of the 2D Euler equation maximizes a if-function S[cJ] at fixed cir- 
culation and energy. In the special case considered by Huang & Driscoll, this if -function turns 
out to be related to the enstrophy functional T 2 [cJ], which is a particular form of the Tsallis 
H -function S q \uJ] with q = 2. This "dynamical interpretation" based on if-functions is different 
from the "generalized thermodynamical interpretation" of Boghosian [IT] where S q \uJ] is viewed 
as a Tsallis q- entropy. Since (in our sense) the Tsallis functional S q \uJ] is a if -function, not an 
entropy, the use of g-expectation values is irrelevant in this dynamical context. If we want to 
apply Tsallis thermodynamics in the context of the 2D Euler equation, we need to introduce 
an entropy S q [p] which is a functional of the probability density p(r, a). However, in that case, 
the agreement with the plasma experiment fails as shown in [33]. Therefore, the experimental 
result of Huang & Driscoll cannot in fact be explained by Tsallis generalized thermodynamics 
when the full constraints of the Euler equation are accounted for. The fact that the uJ — i[) 
relationship resembles a g-distribution (in cJ-space) is coincidental. This is a particular solution 
of the Euler equation resulting from incomplete violent relaxation. Since the 2D Euler equation 
admits an infinity of stationary solutions, there are many other examples of incomplete violent 
relaxation in 2D turbulence (and stellar dynamics) where the system settles in a steady state 
that is not described by the Tsallis distribution (in aJ-space or in p-space). The situation de- 
scribed by Huang & Driscoll in which an ZJ — ip relationship resembling a g-distribution emerges 
is fortuitous and not generic. 

In this paper, we have tried to distinguish different notions of entropy that arise in the the- 
ory of violent relaxation. The mixing entropy (}26|) (|71*|l is the fundamental entropy of the theory. 
It can be obtained by a combinatorial analysis and its maximization at fixed mass/circulation, 
energy and Casimir invariants determines the most probable distribution of fine-grained lev- 
els p(r, v,?]) through the Gibbs state (|25 |l (J75j) . assuming ergodicity (complete mixing). The 
generalized mixing entropy (j71j) is the appropriate Tsallis generalization of (|26J) in the con- 
text of violent relaxation. It can be seen as an attempt to take into account non-ergodic 
effects and describe them in terms of a single parameter q. All the machinery of non-extensive 
thermodynamics (g-expectation values,...) could be developed in that framework, working 
with p(r,v,i]) instead of /(r,v). We might also consider other generalizations of entropy 
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S = — J C(p)d 3 rd 3 vdi] where C is convex. The status of such generalizations is still in debate 
for the moment because it is not clear whether non-ergodic effects can be encapsulated in a 
simple functional. One must rather accept that the final state of the system is unpredictable in 
case of incomplete violent relaxation. The relative entropy (18 1|) is the Legendre transform of the 
mixing entropy (f74"|) conditioned by a prior vorticity distribution %(cr) in the sense of [T%1 IP3]. 
This description can be relevant for 2D turbulent flows that are forced at small-scales. Its max- 
imization at fixed circulation and energy (no other invariants) determines the most probable 
distribution of fine-grained levels p(r, a) through the Gibbs state (fTo*)) conditioned by an im- 
posed global distribution The generalized entropy (|H7j) - (pSj) or (fTHjl - lfHUj) is the functional 
that the most probable coarse-grained distribution f(r, v) or uJ(r) given by (|HU|) - (j77|) maximizes 
at fixed energy and mass/circulation. For freely evolving systems, it depends on the initial con- 
ditions. For forced systems, it is determined by the prior vorticity distribution through 
the formula ()82|). The H -functions ()68|) are arbitrary functionals (not entropies) of the coarse- 
grained field. They increase during mixing and their maximization at fixed mass/circulation 
and energy determines a nonlinearly dynamically stable stationary solution of the Vlasov/Euler 
equation with a monotonic relationship / = /(e) or w = These stationary solutions can 
result from complete or incomplete violent relaxation (in that case, / and u must be regarded 
as the coarse-grained fields). When mixing is complete, the if-function that is maximized at 
equilibrium is the generalized entropy ([37 )1 - ([43 )1 or ([78 )1 - ([80| ) . When mixing is incomplete, the 
^/-functions and the coarse-grained distributions can take forms that are not consistent with 
the statistical theory. For example, Tsallis functional (J69|) is a particular i?-function associ- 
ated with stellar polytropes and polytropic vortices. They form simple families of stationary 
solutions of the Vlasov and 2D Euler equations. They sometimes arise as a result of incom- 
plete violent relaxation due to the combined effect of Casimir constraints and non-ergodicity 
|4U[ I34j . The maximization of a if -function at fixed mass/circulation and energy is a condition 
of nonlinear dynamical stability. We can develop a thermodynamical analogy and an effective 
thermodynamical formalism to study the nonlinear dynamical stability of the system, but the 
notion of "generalized thermodynamics" is essentially effective in that context [HJ . 

In conclusion, a striking property of systems with long-range interactions is the rapid emer- 
gence of coherent structures: galaxies in astrophysics, vortices and jets in 2D turbulence, quasi- 
equilibrium states in the HMF model... Since these metaequilibrium states are not described 
by the Boltzmann distribution, some authors have proposed to replace the Boltzmann entropy 
Sslf] by the Tsallis entropy S g [f], invoking that the system is non-extensive so that standard 
statistical mechanics is not applicable |4"T) |4"2*1 03] . However, this approach ignores the impor- 
tance of the Vlasov equation and the concept of violent relaxation introduced by Lynden-Bell 
|12j . The description of coherent structures in Vlasov systems is complicated but it can be ex- 
plained in terms of "classical" principles without invoking a generalized thermodynamics [T7] . 
Our discussion indicates that there are two independent reasons why the quasi-equilibrium 
states that form as a result of violent relaxation are non-Boltzmannian. This is due, on the one 
hand, to the existence of fine-grained constraints (the Casimirs) which depend on the initial 
conditions and, on the other hand, to incomplete relaxation (non-ergodicity, partial mixing). 
Even in case of ergodicity (complete mixing), we can have a wide diversity of non-standard 
distributions depending on the initial conditions. They are given by Eq. (J3(Jj) according to 
the statistical theory of Lynden-Bell. They are sorts of superstatistics. Moreover, if the sys- 
tem does not mix efficiently, the Lynden-Bell prediction breaks down and even more general 
distributions can be observed. They are stable stationary solutions of the Vlasov equation on 
the coarse-grained scale. The prediction of the metaequilibrium state in case of incomplete 
relaxation is extremely complicated, if not impossible. One possibility is to change the form 
of entropy. However, the metaequilibrium state cannot apparently be described by a universal 
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functional such as the Tsallis functional, even if it is extended to the form (|71|) so as to take into 
account the specificities of the collisionless evolution (Casimir constraints). An alternative ap- 
proach is to keep the Lynden-Bell entropy but develop a dynamical theory of violent relaxation 
as initiated in jTHl to understand what prevents complete mixing. In that case, we have to 
solve a dynamical equation with a non-constant diffusion coefficient related to the fluctuations. 
The if-functions can also be useful to construct stable models of galaxies (and 2D vortices) 
in order to reproduce observed phenomena. In some specific situations, some if-functions (be- 
longing to the same "class of equivalence") may be more appropriate than others to describe 
the system, so that a phenomenological notion of "effective generalized thermodynamics" (in 
/-space or cJ-space) can be developed to deal with complex systems in a simple and practical 
way [Oj. In that point of view, the relevant functional should be found by trying and errors. 



A //-functions for the 2D Euler equation 

We briefly recall, and adapt to the case of the 2D Euler equation, the notion of //-functions 
introduced by Tremaine et al. for the Vlasov equation. These concepts have not been 
introduced in 2D turbulence. A if-function is a functional of the coarse-grained vorticity of 
the form 

(98) H = - J C(ZJ)dh, 

where C is a convex function. We assume that the initial condition at t — has been prepared 
without small-scale structure so that the fine-grained and coarse-grained vorticity fields are 
equal: aj(r, 0) = uu(r, 0). For t > 0, the system will mix in a complicated manner and develop 
intermingled filaments so that these two fields will not be equal anymore. We have 

H(t)-H(0) = J {C[uJ(r,0)] -Cp(r,t)}}d 2 r 

(99) = J{C[u(r,0)]-C[uJ(v,t)]}d 2 v. 

The fine-grained vorticity is solution of the 2D Euler equation 

du; „ 

100 — + u-Vcu = 0, 

at 

where u(r, t) is an incompressible velocity field. Thus 

j t J C{uj)d 2 v = J C\u)^d 2 v = - J C'(lu)u ■ Vujdh 

(101) = - J u ■ VC(uj)d 2 r = - J V(C(uj)u)d 2 r = 0. 

This shows that the if-function H[u] calculated with the fine-grained vorticity is independent 
on time (it is a particular Casimir) so Eq. (}9"9"j) becomes 

(102) H{t) - H(0) = J {C[co(r, t)} - C[lJ(y, t)]}d 2 r. 

Now, a macrocell is divided into v microcells of size h = A/ v. We call Ui the value of the 
vorticity in a microcell. The contribution of a macrocell to H(t) — H(0) is 

(103) A {^E^)- G (^5>)} 
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which is positive since C is convex. Therefore, the iJ-functions calculated with the coarse- 
grained vorticity H[uj] increase in the sense that H(t) > H(0) for any t > 0. Note, however, 
that nothing is said concerning the relative value of H(t) and H(t') for t, t' > so that the 
increase is not necessarily monotonic. 

In 2D hydrodynamics, the viscosity has an effect similar to coarse-graining. Indeed, consid- 
ering the Navier-Stokes equation 

(104) ^ + u • Voo = uAlo, 
with v > 0, we get 

H = -j t J C{uo)dh = - J C\uj)^d 2 v = -v J C\uo)Auod 2 v 

(105) =u J VC\uo) ■ Vuod 2 r = v J C"{uo){Vuo) 2 d 2 r > 0. 
In that case, the increase of H is monotonic. 
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